Cross-validation estimations of hyper-parameters of Gaussian processes with inequality constraints

نویسندگان

  • Hassan Maatouk
  • Olivier Roustant
  • Yann Richet
چکیده

In many situations physical systems may be known to satisfy inequality constraints with respect to some or all input parameters. When building a surrogate model of this system (like in the framework of computer experiments), one should integrate such expert knowledge inside the emulator structure. We proposed a new methodology to incorporate both equality conditions and inequality constraints into a Gaussian process emulator such that all conditional simulations satisfy the inequality constraints in the whole domain. An estimator called mode (maximum a posteriori) is calculated and satisfies the inequality constraints. Herein we focus on the estimation of covariance hyper-parameters and cross validation methods. We prove that these methods are suited to inequality constraints. Applied to real data in two dimensions, the numerical results show that the Leave-One-Out mean square error criterion using the mode is more efficient than the usual (unconstrained) Kriging mean. © 2015 The Authors. Published by Elsevier B.V. Peer-review under responsibility of Spatial Statistics 2015: Emerging Patterns committee.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification model selection via bilevel programming

Support vector machines and related classification models require the solution of convex optimization problems that have one or more regularization hyper-parameters. Typically, the hyper-parameters are selected to minimize cross validated estimates of the out-of-sample classification error of the model. This cross-validation optimization problem can be formulated as a bilevel program in which t...

متن کامل

Performance-Estimation Properties of Cross-Validation-Based Protocols with Simultaneous Hyper-Parameter Optimization

In a typical supervised data analysis task, one needs to perform the following two tasks: (a) select the best combination of learning methods (e.g., for variable selection and classifier) and tune their hyper-parameters (e.g., K in K-NN), also called model selection, and (b) provide an estimate of the performance of the final, reported model. Combining the two tasks is not trivial because when ...

متن کامل

A revisit of a mathematical model for solving fully fuzzy linear programming problem with trapezoidal fuzzy numbers

In this paper fully fuzzy linear programming (FFLP) problem with both equality and inequality constraints is considered where all the parameters and decision variables are represented by non-negative trapezoidal fuzzy numbers. According to the current approach, the FFLP problem with equality constraints first is converted into a multi–objective linear programming (MOLP) problem with crisp const...

متن کامل

A Generic Path Algorithm for Regularized Statistical Estimation.

Regularization is widely used in statistics and machine learning to prevent overfitting and gear solution towards prior information. In general, a regularized estimation problem minimizes the sum of a loss function and a penalty term. The penalty term is usually weighted by a tuning parameter and encourages certain constraints on the parameters to be estimated. Particular choices of constraints...

متن کامل

Spatial Decompositions for Large Scale SVMs

Although support vector machines (SVMs) are theoretically well understood, their underlying optimization problem becomes very expensive if, for example, hundreds of thousands of samples and a non-linear kernel are considered. Several approaches have been proposed in the past to address this serious limitation. In this work we investigate a decomposition strategy that learns on small, spatially ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017